On Jordan angles and triangle inequality in Grassmannian 



Yurii A.NeretinQ 
Abstract 

Let L,M,N be p-dimensional subspaces in R n . Let ipj be the angles 
between L and M, let ipj be the angles between M and N, and 6j be 
the angles between L and M. Consider the orbit of the vector tp € R p 
with respect to permutations of coordinates and inversions of axises. Let 
Z be the convex hull of this orbit. Then 9 6 tp + Z. We discuss similar 
theorems for other symmetric spaces. 

We also obtain formula for geodesic distance on any invariant convex 
Finsler metrics on classical symmetric space 



We obtain a version of V.B.Lidskii theorem j9[ on spectrum of a sum of 
matrices for arbitrary classical Riemannian semisimple symmetric space ^. 

1. Grassmannians. Fix positive integers p ^ q. Consider the space M. p+q 
equipped with the standard scalar product. Denote by Gr Pj9 the set of all p- 
dimensional linear subspaces in ]R P+ ' 3 . The orthogonal group 0(p + q) acts in 
M. p+q and hence it acts on Gr Pi9 . Obviously, 

Gr M = 0(p + g)/0(p)xO(«) 

2. Jordan angles. Let L,M £ Gr Pjg . Consider an orthonormal basis 
ei, . . . , e p G L and an orthonormal basis fi, ■ • ■ , f p E M. Consider the matrix 

A = A[L, M] 

with the matrix elements < ej, fj >. Denote by 

Ai > . . . ^ X p 

the singular values^ of the matrix A. Obviously, the numbers 

Xj = Xj[L,M] 

don't depend on choice of the bases e\, . . . , e p E L and /i, . . . , f p E M. 

PROPOSITION 1. Let L,M,L',M' E Gr Pig . The following conditions are 
equivalent 

i) Xj[L,M] = Xj[L',M'] for all j 

Supported by grants RFBR-98-01-00303, NWO 047-008-009 
2 The result of the paper was announced in m3 

3 The singular values of a matrix A are the eigenvalues of the matrix V A* A. 
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ii) There exists an element g € 0(p + q) such that gL = V ', gM = M' . 
Proof. The statement is obvious. 

Proposition 2. Consider L,M € dp,?- There exist orthonormal bases 
ei, . . . , e p £ L and fx, ■ ■ ■ , f p 6 M such that 

< ei, fj >= if i ^ j 

< e iJj >= X j 
Proof. The statement is obvious. 
Proposition 3. 

a) Ak[L,M]= max min max <v,w> 

P c L v eP w eM 
\\ v \\ = i Hl = i 

where the first maximum is given over all k- dimensional subspaces P in L. 

b) Xk[L,M] = min max max < v,w > 

Q C L v eQ w e M 

\\v\\ = i H[ = i 

where the minimum is given over all subspaces Q having codimension k in L. 
Proof. The statement is a corollary of the standard minimax characteriza- 



tions of eigenvalues and singular values, see 10 , 0. 

Proposition 4. Denote by Hm the orthogonal projector to the subspace M . 
Then the numbers Xj[L,M] are the singular values of the operator 

U M :L->M 



Proof. The statement is obvious. 

Lemma 5 . Let u\ , U2, ■ ■ ■ , u p be a ( nonorthogonal ) basis in L. Let v\, 1)2, ■ ■ ■ , v p 
be a ( nonorthogonal ) basis in M. Denote by U the matrix with the matrix ele- 
ments < Ui,Uj >, denote by V the matrix with the matrix elements < Vi,Vj >, 
denote by W the matrix with the matrix elements < Ui,Vj >. Then the numbers 
Xj[L,M] coincides with eigenvalues of the matrix 

U^WV^W 1 (1) 
Proof. The statement is obvious. 

The angles or stationary angles^ (K.Jordan, 1875) ip\ ^ ip2 ^ • • ■ *S V'p 
between the subspaces L, M £ Gr Pi9 are defined by 

ipl = ^i[L, M] :— arccos Ai, . . . , ip p = ^ P [L, M] := arccosA p 



4 Othcr_tarms axecomplex distance or compound distance. Models of symmetric spaces 
given in ||13| and Jl5j shows that these invariants for all classical Riemannian symmetric 
spaces are really angles. 
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Obviously, ^ ipj ^ n/2. We also will use the notation 
*IL,M} = (* 1 [L,M],...,* P IL,M}) 
Remark. For all j we have Vj[L, M] = ^j[M, L\. 

3. The result of the paper. Denote by W p (Weyl group) the group of all 
transformations of W generated by permutations of the coordinates and by the 
transformations 

(ti, . . . ,t p ) h-> (<7iii, . . . ,a p tp) where a d = ±1 

Theorem A. Let l(x) be a W p -invariant norm in W. Then the function 
d(L,M) :=l(* 1 [L,M],...,*p[L,M}) 

is an 0(p + q) -invariant metric on Gr p . q . 

Remark. The geodesic distance in Gr Pi9 associated with the 0(p) x 0(g)- 
invariant Riemannian metrics is given by the formula 

dist(L,M) = y^i[L,M] 2 + --- + * p [L,M] 2 

Theorem B. Let L,M,N G Gr p , 9 . Let <pj = %[L,M], tpj = ^j[M,N], 
9j = ^j[L, N] be the angles. Denote by Z the convex hull of the W p -orbit of the 
vector (ipi, . . . , ip p ) G W . Denote by It the shift of Z by the vector (<pi, . . . , ip p ). 

Then there exists a vector (91, ... ,9°) G II such that the collection of numbers 
(cos 9°, . . . ,cos9°) coincides up to permutation with the collection of numbers 
(cos 0i, . . . , cos# p ). 

4. Infinitesimal angular structure. For L G Gr Pjg we denote by 
Tl(Gv p ^) the tangent space to Gr Pi<? at the point L. It is natural to identify 
elements £ G Tj,(Gr P;9 ) with operators H from L to the orthogonal complement 
L ± . We denote by 

Pl [L;H]^...^p p [L;H] 

the singular values of the operator H : L — > L ± . 

Lemma 6. Let L,L' e Gr Pi9 , H G T L (Gr Piq ) , H' e T L >(Gr Ptq ). The 
following conditions are equivalent 

i) Pj [L;H]= Pj [L';H']for all j 

ii) There exists an operator g G 0(p + q) such that gL — L', gH = H' . 
Proof. The statement is obvious. 

Remark. The 0(p + q)-invariant Riemannian metric in Gr Pi<? is 
tr H t H = J2p 2 j [L;H] 
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5. Relations between angles and infinitesimal angular structure. 

The following statement is obvious. 

PROPOSITION 7. Let M(t) be a smooth C 00 -curve in Gr p . q . Then 



lim 

£^ + 



^ J [M{a + e),M(a)} 



= Pj [M(a);M'(a)] 



where lim denotes the right limit at 0. 



6. Infinitesimal variation of angles. Let L,M G Gr p . q . Let ej G L, 
fj G M be orthonormal bases satisfying the conditions 



<e l J j >=0 if 
< ej, fj >= cos ipj 



i ¥= 3 



Assume xpj be pairwise different. 

Consider an orthonormal basis n , . . . ,r g G M 1 - such that for all j < p 
the vectors ej, fj, Tj span 2-dimensional plane and fj is situated in the angle 



between ej and Tj. 




We have 

e j = fj cos V'j — r i sm V'j 

Let H : M M 1 - be a tangent vector to Gr p . q at the point M. Let hij be 
the matrix elements of H in the bases fi, . . . , f p and ri , . . . , r q . 
Consider a C 00 -smooth curve M(e) G Gr Pi9 such that 

M(0) = H; M'(0) = i? 
Proposition 8. Assume ip ^ 0, V P 7^ 7r /2, a«d V'j+i 7^ V'j / or a ^ J- r/ien 

^ (2) 



-*j[L,M(s) 



6=0 



Proof. Denote by fj{e) the unique vector in M(e) having the form 

fA £ ) = f 3 + J2 a 3 k ^ rk 
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Obviously, 

a jk {e) = eh jk + 0(e 2 ) 

For all i,j ^ p we have 

</.«>,«.»-{ if it % 

and 

f -£/i ir j sin^i + 0(e 2 ), if i ^ j 

Now we are ready to wright matrix (1) for the subspaces L, M(e) 

/cos 2 (Vi +eh n ) + 0(e 2 ) 0(e) ... 0(e) 

0(e) cos 2 (iP 2 +eh 22 )+0(e 2 ) ... 0(e) 

V 0(e) 0(e) ... cos 2 (^ p +eh pp ) + 0(e 2 )J 

This implies required statement. 
Remark. Let us fix L € Gr Pi9 . 

a) The set of all M G Gr p q such that ^q[L, M] = has codimension q—p+1. 

b) The set of all M € Gr Pjg such that ^ P [L, M] — tt/2 has codimension 1. 

c) The set of all M £ Gr Pi9 such that tyj + i[L, M] — *&j[L, M] has codimen- 
sion 2. 

7. Preliminaries. A p x p matrix A is called bistochastic if for all fc, 2 
2ja ifc = l; 22aij=l 

i 3 

We say that a real p x p matrix A is quasistochastic if for all fc, 2 
^ < 1; |%-| < 1 



Proposition 9. (Birkhoff) The set of all bistochastic matrices is the convex 
hull of matrices of permutation^ 

See @, §. 

Lemma 10. The set of all quasistochastic matrices is the convex hull of the 
qroup W p . 

Proof. It is sufficient to describe extremal points of the set of all quasis- 
tochastic matrices. 



3 i.c. matrices consisting of and 1 and having strictly one 1 in each column and each row. 
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a) Obviously, for any extremal point A 



(3) 



b) Let a matrix A satisfies condition (3). Assume the matrix a^- be not 
an extremal point of the set of stochastic matrices. Then A is not an extremal 
point of the set of quasistochastic matrices. 

Hence any extremal point of the set of quasistochastic matrices is an element 
of W p . 

Example. Let U = (ity) and V — (Vij) be matrices with norm ^ 1. Then 
the matrix W — (uijVij) is quasistochastic. 

Lemma 11. [J Let A be a real p x q matrix. Let A = (Ai,...,A p ) be its 
singular values. Then the convex hull of the W p -orbit of A contains the vector 
(an, . . . , a pp ). 

Proof. Indeed, the matrix A can be represented in the form 



A = U 



/% 

A 2 



\ 






A H 



\ 





0/ 



r' 



U e 0(p), V G 0(g) 



Hence 



/ aii\ 




( U\\V\\ 


U\2V\2 ■ 


■ UipVip^ 






a-22 






"22^22 • 


■ U2pV 2 p 




A 2 


\ a ppJ 




\UpiVpi 


Up2Vp 2 ■ 


UppVpp J 







Then we apply Lemma 10. 

8. Proof of Theorems A— B. Fix nonnegative numbers 

a\ $J . . . ^ a p 

Fix arbitrary orthonormal basis 

ei,e 2 . . . ,e p , /i,/ 2 . . . , f p ,n, . . . ,r q ^ p G R p+q 

Consider the subspace L a (s) £ Gr Pj(? spanned by the vectors V\(s),. 
given by the formula 

Vj(s) — cos(ajs)ej + sm(ajs)fj 
We obtain a curve L a (s) in Gr Pj9 . 

6 This is a minor variation of Fan Ky theorem, see Jlo| , 



(4) 
,v p (s) 



G 



We say that a curve j(t) in Gr p ^ is a H -curve if in some orthonormal basis 
it has the form L a (s). Wc say that the numbers dj are the invariants of the 
H -curve 7. 

We say that points L a (s), L a {t) on a _ff-curve are sufficiently near if 

a p \s — t| < 7r/2 
The following statements are obvious 

Lemma 12. Consider sufficiently near points L(si), L(s2), L(s3) on Hi- 
curve. Assume si < S2 < S3. Then for all j 

*j[L( Sl ),L( S2 )} + ^ 3 [L(s 2 ),L(s 3 )} = v^[L( Sl ),L( S3 )] 

Lemma 13. Let L,M G Gr Pi9 and ^ p [i,M] < n/2. Then there exists the 
unique H -curve j(s) joining L, M such that L, M are sufficiently near points 
on 7(s). 

Consider points L, M, N e Gr p , q having a general position. Denote by 9j 
the angles between M and TV. Consider the iT-curve j(t) such that 7(0) = M, 
7(1) = iV and M, TV are sufficiently near points of the curve j(t). Then the 
invariants of the H-cmve 7(s) are 9\, . . . , 9 p . We assume, that for each s G [0, 1] 

*o[L,M(8)]?0, * p [L,M(s)}^n/2 (5) 
y 3 {L,M(s)]^y j+1 [L,M(s)} for all j (6) 

Denote by Z the convex hull of W p -orbit of the vector 9. 
By Proposition 8 and Lemma 11, we have 

*[L, 7 (l/n)] G *[L, Af] + (-+ 0(\))Z, n -> 00 
In the same way, 

7 (2/n)] 6 *[L, 7 (l/n)] + (- + 0{\)) Z 
*[L,7(3/n)] G *[L )7 (2/n)] + (i +0(^))2 

and 0(l/n 2 ) are uniform in /c/n. Hence 

*[L,7(*)] e *[L,Af] + t(l + 0(i))Z; n -» 00 

and hence 

*[L, 7 (t)]e*[L,M] + tZ; (7) 
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Assume that there exists the unique value s that doesn't satisfies the condi- 
tions (5)-(6). Consider a small S. Then for t > s we have 

$[L,j(s-S)] e ¥[L,M] + (s-8)Z 
V[L,j(s-5)] is close to ^[L,j(s + S)} 
V[L, 7(t)] e 7(a + J)] + (t - a + 5)Z 

and we again obtain (7). 

A _ff-curve of general position contains a finite number of points J that don't 
satisfy conditions (5)— (6). Hence we can repeat our arguments. 

This proves Theorem B. 

Consider a W p -invariant norm £(■) on MP. By Lemma 11 for any ieZ 

i[x) s? 1(0) 

and this finishes the proof of Theorem A. 

COROLLARY 14. H-curves are geodesies in any metrics having the form 

d(L,M)=l(V 1 [L,M],...,%[L,M]) (8) 

Remark. If the sphere i(x) = 1 in R" doesn't contain a segment, then this 
geodesies is unique. 

Corollary 15. Consider the Finsler metric on Gr P)( j given by the formula 

F(L, H) = £{ Pl [L, H],..., Pl [L, H]); L e Gr p , q , HeT L 

Then the associated geodesic distance is given by the expression (8). 

9. Other symmetric spaces. In it was explained that arbitrary classi- 
cal compact symmetric space is a Grassmannian in real, complex or quaternionic 
linear space. This allows to translate literally our results to all classical compact 
Riemannian symmetric spaces 

U(n)xU(n)/U(n); U(n)/0(n); U(2n)/Sp(n) (9) 

U(p + g)/U(p) xU(j); 0(2n)/U(n); 0(p + «)/0(p) x 0(g); 0(n) x 0(n)/0(n); 

S P (p + g)/Sp(p) x Sp(g); Sp(n)/U(n); Sp(n) x Sp(n)/Sp(n) 

Three series of the type A (i.e (9)) slightly differs from others: we have to replace 
the group W p by the symmetric group. 

In the same way, all classical Riemannian noncompact symmetric spaces are 
open domains in Grassmannians (see p2|). This allows to extend our results to 
all classical Riemannian noncompact symmetric spaces 

GL(n,C)/U(n); GL(n,K)/U(n); GL(n,H)/U(n) 

U(p,g)/U(p)xU(g); S0*(2n)/U(n); 0(p,q)/0(p) X 0(g); 0(n,C)/0(n); 
Sp(p,g)/Sp(p) xSp(g); Sp(2n,R)/U(n); Sp(n, C)/Sp(n) 
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10. Some examples, a) The space GL(n, C)/U(n). We realize points of 
the space as positive definite nxn complex matrices. The group GL(n, C) acts 
on this space by the transformations 

L^gLg*, 9 eGL(n,C) 

The angZe^] ^j[L,M] between points L and M are the solutions of the 
equation 

det(L - e^M) = 

Denote by M] the vector M], M}). 

Let L, M, N be points of our space. Consider all vectors in M. n that can 
be obtained from ^[L, M] by permutations of coordinates. Denote by Z their 
convex hull. Then 

V[L,N] €^[L,M] + Z (10) 

b) Original Lidskii theorem. Consider the space § of hermitian nxn matrices. 
This space also is a (nonsemisimple) symmetric space. The group of isometries 
is the group of transformations 

X i > UXU* + A where U E U(n), A 6 S 

Let l,y*eS. The analogy of angles are the eigenvalues of X — Y. The analogy 
of Theorem B is the original Lidskii theorem |9| . 

The space § can be identified with the tangent space to GL(n, C) at the 
point 1. For X, Y G S we define matrices 

L = l + eA, M = l+eB e GL(n,C)/U(n) 

where e is small. Then the angles between L and M have the form eXj , where Xj 
are the eigenvalues of X — Y. Hence the inclusion (10) implies Lidskii theorem. 

Lidskii theorem on singular values of sum of two matrices corresponds to the 
triangle inequality in a tangent space to U(p + g)/U(p) x U(g) or U(p, q)/V(p) x 
U(g) 

c) The space Sp(2n,M)/U(n). This spaces can be realized as the space of 
symmetric nxn complex matrices with norm < 1 (see for instance ]l4|],5.1,6.3). 
For two points T, S we define the expression 

A[T, S] = (1 - TT*)~ 1//2 (1 - TS)(1 - SS*y 1/2 (11) 

Let Xj be its singular values. Then the hyperbolic angles between T and S are 
given by the formula 

= arcosh Xj (12) 

7 hyperbolic angles 



The analogy of Theorem B is given by the formula (10). 

d) Arazy norms. Denote by Vfi n the space of finite real sequences x = 
(xi, ... ,xj\r, 0,0, .. .)■ Consider a norm I on a space Vjm satisfying the condi- 
tions 

- t is invariant with respect to permutations of coordinates 

- 1 is invariant with respect to the transformations {x\, x<i, . . . ) i— > {p\X\, o~iXi, . . . ), 
where <Tj = ±1 

- if x^ converges to x coordinate- wise and £(x^) converges to £(x), then 
£(x^' — x) converges to (an equivalent formulation: the ^-convergence on the 
sphere l(x) = 1 is equivalent to the coordinate- wise convergence). 

Let Vt is the completion of Vfi n with respect to the norm I. 

A compact operator A in a Hilbert space is an element of Arazy class (see 
[0 ) Ct if the sequence of its singular values is an element of Vi . Consider a space 
B^ (operator ball) of all compact operators T in the Hilbert space satisfying the 
conditions 

-TeCt 

\\T\\ < 1, where T denotes the standard norm of a operator in a hilbrt 

space 

_ T = T t 

We define the angles f 3 (T,S) in B e by formulas (11)-(12). We define the 
distance in B^ by 

dt(T,S)=i{9i(T,S),*2(T, S),...) 
Proposition 16. a) de(T,S) is a metric. 

b) The space B^ is complete with respect to the metric dt(T,S). 

The statement a) can be easyly obtained from Theorem A by a limit con- 
siderations. For a proof of the statement B see p"4| . 8.6.3. 

11. Some references, a) For matrix inequalities see for instance 

b) The formula for the distance in a symmetric space associated with the 
invariant Riemannian metrics was obtained in [[| 

c) Our Theorem B for the unitary group U(ra) = U(n) x U(n)/U(n) is 
Nudelman-Shvartsman theorem 

d) Generalization of Fan Ky theorem to arbitrary simple Lie algebras was 
obtained in ||. 

e) Let G be a simple Lie group, K be its maximal compact subgroup and 
K\G/K be the hypergroup of iif-biinvariant subsets in G with convolution 
product. The problem about triangle inequality in Grassmannians is related to 
a classical problem on structure of the hypergroup K \ G/K, see for instance 

§, 0, 1- 

f) Some nonstandard geometries on groups are discussed in S. 

g) Some applications of geometry of angles are contained in Q|, 6.3. 
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h) Conjecture. I think that complete triangle inequality for angles coin- 
cides with Horn-Klyachko inequalities Q| , see also [jllj and some comments in 
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